A Stochastic Speech Model Supporting W-Disjoint Orthogonality
نویسندگان
چکیده
In previous work, we have successfully used an ideal joint sparseness assumption: W-Disjoint Orthogonality (WDO). This assumption, that the time-frequency representations of the sources have disjoint support, is satisfied in an approximate sense by many signals of practical interest, including speech. Here we discuss results derived from a stochastic model of speech signals that justify the WDO hypothesis. If the magnitude of the timefrequency components of the source signals have Laplacian priors, a subset of their maximum á posteriori (MAP) estimates are guaranteed to satisfy the WDO assumption.
منابع مشابه
Blind Source Separation of Speech Mixtures using a Simple and Computationally Efficient Time-Frequency Approach
A very simple and extremely computationally efficient algorithm for blind separation of two speech sources from two mixtures is presented in this paper. The algorithm exploits the approximate W-disjoint orthogonality of speech signals and assumes specific sensors (microphones) setting that allows the sources to possess a feature we call cross high-low diversity. Two sources are said to be cross...
متن کامل8 The DUET Blind Source Separation
This chapter presents a tutorial on the DUET Blind Source Separation method which can separate any number of sources using only two mixtures. The method is valid when sources are W-disjoint orthogonal, that is, when the supports of the windowed Fourier transform of the signals in the mixture are disjoint. For anechoic mixtures of attenuated and delayed sources, the method allows one to estimate...
متن کاملOn the Window-disjoint-orthogonality of Speech Sources in Reverberant Humanoid Scenarios
Many speech source separation approaches are based on the assumption of orthogonality of speech sources in the time-frequency domain. The target speech source is demixed from the mixture by applying the ideal binary mask to the mixture. The time-frequency orthogonality of speech sources is investigated in detail only for anechoic and artificially mixed speech mixtures. This paper evaluates how ...
متن کاملThe optimal ratio time-frequency mask for speech separation in terms of the signal-to-noise ratio.
In this paper, a computational goal for a monaural speech separation system is proposed. Since this goal is derived by maximizing the signal-to-noise ratio (SNR), it is called the optimal ratio mask (ORM). Under the approximate W-Disjoint Orthogonality assumption which almost always holds due to the sparse nature of speech, theoretical analysis shows that the ORM can improve the SNR about 10log...
متن کاملHollywood Post-9/11 Alien Films: Recontextualization of George. W. Bush's Discourse
The widely shocking attacks on the World Trade Center and the Pentagon on September 11 was interpreted differently by various institutions worldwide, ranging from America's legitimate motive to begin a 'war on terror' to defend her 'very freedom', to a pretext for Bush administration to pursue G. H. W. Bush's temptation for 'a new world order'. Various institutions, therefore, made use of their...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003